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Y Mpt - amended determination of commonality et One or more 

tAAMrLfc a positions would remain within the definition because 

As discussed infra with respect to consensus leuko- they would nonetheless predominantly include com- 

cyte interferon, those human leukocyte interferon sub- mon amino acid* and because those amino acids no 

types having a threonine residue at position 14 and a 5 longer held to be common would nonetheless quite 

methionine residue at position 16 are reputed to display likely represent the predominant amino acid at the 

greater antiviral activity than those subtypes possessing given positions. Failure of a polypeptide to include 

Ala 14 and lie 16 residues. An analog of human leukocyte either a common or predominant amino acid at any 

interferon subtype F was therefore manufactured by given position would not remove the molecule from the 

means of microbial expression of a DN A sequence of 10 definition so long as the residue at the position occurred 

Example 7 which had been altered to specify threonine in at least one subtype. Polypeptides lacking one or 

and methionine as residues 14 and lc> respectively. more internal or terminal residues of consensu* human 

More specifically* [Thr 14 , Met^ JFN-oF. designated leukocyte interferon or including internal or terminal 

IFN-etFj, was expressed in Rooli upon transformation residues having no counterpart in any subtype would be 

with a vector of Example 7 which had been cat with 33 considered analogs of human consensus leukocyte Inter- 

Sail and Hindfll and into which a modified subunit II feron. 

(or Table VII) was inserted. The specific modifications Published predicted amino acid sequences ftV eight 

of summit II involved assembly with j*cgmcnt 39 altered cONA-derived human leukocyte interferon subtypes 

to replace the alaitine-specifying codon, GCT, with a were analyzed in the context of the identities of amino 

threonine -specifying ACT codon and replace the 30 acids wrthm the Sequence of 166 residue*. Sec, gener- 

i$Oleucinc-9pecifying codon, ATT, with an ATG CO- ally, Goedell, et ah t Nature, 290, pp. 20-26 (1981) com- 

don. Corresponding changes in complementary bases paring LeIFN-A through LeIFN-H and noting that 

were made in section 40 of subunrt LeutFN-FII. only 79 amino acids appear in identical positions in all 

The following Examples 9 and 10 relate to practice of eight interferon forms and 99 ammo acid* appear in 
the invention in the microbial synthesis of consensus 25 identical positions if the E subtype (deduced from a 
human leukocyte interferon polypeptides which can be cDNA pseudogene) was ignored. Each of the remain- 
designated as analogs of human leukocyte interferon mg positions was analyzed for the relative frequency of 
subtype F. occurrence of a given amino add and, where a given 

amino acid appeared at the same position in at least five 

KXAMFLE 9 3{) 0 fthe eight fornis, it was dedgnaled as the predominant 

"Consensus human leukocyte interferon" ^ 4 JFN- amino acid for that position* A "consensus" polypeptide 

Con," "LcnlFN-Ccm") as employed herein shall mean a sequence of 166 amino acids was plotted out and com- 

normaturally-occurring polypeptide which predomi- pared back to the eight individual sequences, resulting 

nantfy includes those amino acid residues which are in the determination that LeIFN-F required few modifi- 

eommon to all naturally-occurring human leukocyte 35 cations from its "naturally-occnrring" form to comply 

interferon subtype sequences and which includes, at one with the consensus sequence, 

or more of those positions wherein. there is no amino A program for construction of a manufactured IFN- 

acid common to all subtypes, an amino add which pre* Con DNA sequence was developed and is set out below 

dommantly occurs at that position and in no event in- in Table VIII. In the table, an asterisk designates the 

dudes any amino add residue which is not extant in that 40 variations in IFNstF needed to develop LeXFN-Coni, 

position in at least one naturally-occurring subtype. i.e., to develop the [Arg 2 *, Ala 76 , Asp 7 *, Glu 79 , Tyr**, 

(For purposes of this definition* subtype A is position- Tyr*°, Leu*, Thr 1 *, Asn* 57 , Leu* 5 *] analog of IFN- 

ally aligned with other subtypes and thus reveals a aF. The Dlustrated top strand sequence includes, wher- 

"missing" amino acid at position 44.) As so defined, a ever possible, codons noted to the subject of p refer en- 

consen&us human leukocyte interferon will ordinarily 45 tial expression an E. coli The sequence also includes 

include all known common amino add residues of an bases providing recognition sites for Sal, HhndUI, and 

subtypes- It will be understood that the state of know!- BstE2 at positions intermediate the sequence and for 

edge concerning naturally-occurring subtype sequences XBal and BamHI at its ends. The latter sites are selected 

is continuously developing. New subtypes may be dis- for use in incorporation of the sequence in a pBR322 

covered which may destroy the "coxrnnroatity" of a 50 vector, as was the case with the sequence developed for 

particular residue at a particular position. Polypeptides IFN-aF and its analogs, 
whose structures are predicted on the basis of a later- 

TABLE VIII 

1 

-~C y* " -Aip Lee— — Jfo* KjIa— Thr— 

ATQ 



-Met- 

CTC ATT CTG CTG GCA CAG ATG CGT COT ATT TCC OCO TTT AGC TGC 




OOC AAC CAA TTC CAG AAA OCT CAO OCA ATC TCT OTA CTO CAC CAA 
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Example 1 which had been ahered to specify threonine likely represent the predominant amino acid at the 

and methionine as residues 14 and 16, respectively. given positions. Failure of a polypeptide to include 

More specifically, [Thr 1 *, Met 1 *] TFN-cF, designated either a common or predominant amino add at any 

IFN-aFii was expressed m B> coli upon transformation given position would not remove the molecule from the 

with a vector of Example 7 wjhicto had been cut with 5 definition so long as the residue at the position occurred 

Sail and HindlH and into which a modified sub anil II in at least one subtype. Polypeptides lacking one or 

(of Table Vfl) was inserted. The specific modifications more internal or terminal residues of consensus human 

of sub unit n involved assembly wfth segment 39 altered leukocyte interferon or including internal or terminal 

to replace the alanme-specrfying codon, OCT, with a residues having no counterpart in any subtype would be 

threonine^pecifymg ACT codon and replace the 10 considered aoalogs of human consensus leukocyte mter- 

iseleuemc-spccifymst codon, ATT* with an ATG co- feron, 

don. Corresponding changes in complementary bases Published predicted amino acid sequences for eight 

were made id section 40 of Summit LeuIFNJTL cDNArderived human leukocyte interferon subtypes 

The following Example 9 and tO relate to practice of were analyzed in the context of the identities of amino 
the invention in the microbial synthesis of consensus is acids within the sequence of 166 residues* See, gener- 
human leukocyte interferon polypeptides which can be ally, GoedeU, et at, Nature, 290, pp. 20-26 (1981) corn- 
designated as analogs of human leukocyte Interferon paring LeIFN-A through LeIFN-H and noting that 
subtype F. only 79 amino acids appear in identical positions in nil 
t?YAvnn T? o eight interferon forms and 99 amino acids appear in 
^ MrUi s 20 identical positions ff the B subtype (deduced from a 

"Consensus human leukocyte interferon" ( M TJFN- . cDNA pseudogene) was ignored. Each of the remain- 

Con." "LeuIFN-Con") as employed herein shall mean a ing positions was analyzed for the relative frequency of 

non-naturnHy-cccurring polypeptide which predomi- occurrence of a given amino acid and, where a given 

nantly includes those amino acid residues which are amino acid appeared at the same position in at least five 

common to all naturafly^tmrring human leukoctye 23 of the eight fonn3, it was designated as the predamizjant 

interferon subtype sequences and which includes, at one amino acid for thai position. A "consensus" polyp ep- 

or more of those positions wherein there is no amino tide sequence of 166 amino acids was plotted out and 

acid common to all subtypes, an amino acid which pre- compared back to the eight individual sequences, result- 

dorainantly occurs at that position and m no event in- ing in the determination that LeIFN-F required few 

eludes any amino aa'd residue which ift not extant in that 30 modifications from its "naturally-occurring" form to 

position in at least one naturally-occurring subtype* comply with the consensus sequence, 

(Far purposes of this definition, subtype A is position- A program for construction of a manufactured 1FN- 

ally aligned with other subtypes and thus reveals a Con DNA sequence was developed and is set out below 

"missing" amino add at position 44.) As so defined, a in Table VIIL In the table, an asterisk designates the 

consensus human leukocyte interferon wtU ordinarily 33 variations in EFN-oF needed to develop LcIFN-Coui* 

include all known common amino acid residues of ail i,e* to develop the [Arg 21 , Ala 76 , Asp 78 , Glu™, Tyi* 6 , 

subtypes. It will be understood that the state of knowl- Tyr 90 , Leu 36 , Thr 1 ^, Asn 15 ? Leu I5B ] analog of IFN- 

, edge concerning nattrraUy^ccurring subtype sequences ctF. The illustrated top strand sequence includes, wher- 

ia continnomdy developing. New subtypes may be dis- ever possible, codons noted to tho subject of preferen- 

covered which may destroy the "commonality" of a 40 rial expression in R cvIL The sequence also includes 

particular residue at a particular position. Polypeptides bases providing recognition sites for Sal, HindHI, and 

whose structures are predicted on the basis of a later- Bst£2 at positions intermediate the sequence and for 

amended dctennmation of commonality at one or more XBal and BamHI at its ends. The latter sites axe selected 

positions would remain within the definition because for use in incorporation of the sequence in a pFR322 

they would nonetheless prcdommnntly include com- 45 vector, as was the case with the sequence developed for 

men amino acids and because those amino acids no IFN-aF and its analogs, 
longer held to be common would nonetheless quite 

-l i .xo 

Ma~Cy»~~Asp— Len— Pro— Oln— Thr— Hk— Ser— Leu— Oly — Axn— Arg— Axg— 
ATG TOT OAT TTA CCT CAA ACT CAT TCT CTT OOT AAC COT COC 

m ♦ 

Ala— Lot— Us — Uu— Uu — Ala— Ola — Mai— Ai$— Arg— Urn — Sa 1 — P«— Pbe— 
OCT CTO ATTCTG CTO OCA CAO ATG COT COT ATT TCC CCO TTT 

Sar— Cyv— Leu— Lya— Asp— Aig— His— Asp— Pha— Oly— Phe— Pfo— Oln— Ob— 
AGC TOC CTO AAA OAC CCT CAC OaC TTC OoC TTT CCO CAA OAA 

SO 

Ohi— Pto — Axp— Oly — Asa— Otai— Pbo — Oln— Ly4— Ala^Oln— Ata— lie — gar— 
GAG TTC OAT OOC AAC CAA TTC CAO AAA OCT CAO OCA ATC TCT 
60 

Vil— Leo— Hfa— Obi— Mm— Da — Ota— .Ola —Thr— Phe— A»n— Laa— Phe— 5«— 
OTA CTO CAC OAA ATG ATC CAA CAO ACC TTC AAC CTO TTT TCC 
70 ■ • • SO 

Tbr— Lyn — Aq>— Scr — Scr — Al«— Alt— Trp— Asp— Oln— Ser — Leo — Leu — GItx— 
ACT AAA OAC ACC TCT OCT OCT TOO OAC OAA AGC TTO CTO OAQ 

• *90 
Lys — Pha — Tyr ~ Thr— Olu— Lot — Tyr— Ola— Gla— Lot— Asn — Asp— Leu— Oto- 
AAG TTC TAC ACT OAA CTO TAT CAO CAO CTO AAC OAC CTO OAA 

too HO 
Ala-Cyi-VaJ-lfc-Olfl-Ohi-VDl^Oly-Vri-rihi-Otu-Tlir-ftu-Lea- 
OCA TOC OTA ATC CAO OAA OTT OOT OTA OAA GAG ACT CCO CTO 
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